| html :: wordTagratio :: ºñÀ² HTML ¹®¼ÀÇ ÅäÅ« ¹üÀ§¿¡¼ ű׷Π´Ü¾îÀÇ ºñÀ²À» °áÁ¤ÇϱâÀ§ÇÑ ±âº» ¸ðµâ. |
Áö±Ý ´Ù¿î·Îµå |
html :: wordTagratio :: ºñÀ² ¼øÀ§ ¹× ¿ä¾à
- ƯÇã:
- Perl Artistic License
- °Ô½ÃÀÚ À̸§:
- Jean Tavernier
- °Ô½ÃÀÚ À¥»çÀÌÆ®:
- http://search.cpan.org/~jtaverni/
html :: wordTagratio :: ºñÀ² ű×
html :: wordTagratio :: ºñÀ² ¼³¸í
HTML ¹®¼ÀÇ ÅäÅ« ¹üÀ§¿¡¼ ű׷Π´Ü¾îÀÇ ºñÀ²À» °áÁ¤ÇϱâÀ§ÇÑ ±âº» ¸ðµâ. HTML :: WordTagRatio :: ºñÀ²Àº HTML ¹®¼ÀÇ ÅäÅ« ¹üÀ§¿¡¼ ű׿¡ ´ëÇÑ ´Ü¾îÀÇ ºñÀ²À» °áÁ¤ÇϱâÀ§ÇÑ ±âº» Perl ¸ðµâÀÔ´Ï´Ù. HTML :: WordTagRatio :: ºñÀ²À» »ç¿ëÇÕ´Ï´Ù. html :: content :: htmlTokenizer; HTML :: Content :: ContentExtractor¸¦ »ç¿ëÇϽʽÿÀ. MY $ Tokenizer = »õ HTML :: Content :: HtmlTokenizer ( 'ű×', '´Ü¾î'); ¿±â (html, "index.html"); MY $ DOC = JOIN ( "",); ´Ý±â (HTML); MY ($ word_count_arr_ref, $ tag_count_arr_ref, $ token_type_arr_ref, $ token_hash_ref) = $ tokenizer-> ÅäÅ« È ($ doc); MY $ RATIO = NEW HTML :: WORDTAGRATIO :: ºñÀ² (); $ value = $ RATIO-> RangeValue (0, @ $ word_count_arr_ref, $ tag_count_arr_ref, $ tag_count_arr_ref); html :: wordTagratio :: ºñÀ² ¹× ÆÄ»ý Ŭ·¡½º´Â ÁÖ¾îÁø ¹üÀ§ÀÇ Å±׿¡ ´Ü¾îÀÇ ºñÀ²À» °è»êÇÕ´Ï´Ù. ºñÀ²Àº ±âº» Ŭ·¡½ºÀÌ¸ç ¹üÀ§ÀÇ ´Ü¾î ÅäÅ« ¼ö¸¦ ¹ÝȯÇÕ´Ï´Ù. ¿ä±¸ »çÇ× : ¡¤ Perl.
html :: wordTagratio :: ºñÀ² °ü·Ã ¼ÒÇÁÆ®¿þ¾î